CDS

Accession Number TCMCG075C00964
gbkey CDS
Protein Id XP_017970644.1
Location join(4357257..4357366,4358251..4358317,4358533..4358654,4358824..4358862,4358950..4359072,4359173..4359292,4359393..4359534,4359719..4359898,4359984..4360136,4360226..4360332,4360425..4360515,4360614..4360755,4360886..4361042,4361132..4361248,4361333..4361459,4361560..4361796)
Gene LOC18611426
GeneID 18611426
Organism Theobroma cacao

Protein

Length 677aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA341501
db_source XM_018115155.1
Definition PREDICTED: probable rhamnogalacturonate lyase B [Theobroma cacao]

EGGNOG-MAPPER Annotation

COG_category S
Description Rhamnogalacturonate lyase
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
KEGG_ko ko:K18195        [VIEW IN KEGG]
EC 4.2.2.23        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway -
GOs -

Sequence

CDS:  
ATGGAGAAGGTGGTGAACAGAAGCCACTTGAGACAGCTTGTTTTCTGGCTACTAACGATCATTGAGTTATTACTCCTTCTATCTGCTTCTTCTGAACAACTTTCAGCCAGAAAACTCCTAAGAGATGACACCACTGACCCTCTTCAAGTCCAGCTGGAAACAAGTGATGATAACAGAGTGGTTATTGATAATGGTCTTGTTCAAGTCACTATCGGAAATCCAAATGGTTATCTGATAGGATTAAAATATAAGGGATTTGATAATGTGCTTGAAAGTAGAAACAAGGACCAAAATAAAGGGTACTGGGACATTGTCTGGGATGACAATGCAATTGACAAAATAGAAACAGAACAATTCAAGGTCATAACACAAACTGACGATTTAGTGGAGCTTTCCTTCAGCAAAACTTGGAATTATACAACAGACTATCGCAAGGCAGTTCCCTTAAACATAGACAAAAGGTACATAGTCCGCCGAGGCGTTTCCGGGGTCTACATGTACGGTATCTTCGAGCGCCTGGCCGAATTTCCAGCTGCTCAAATGTACCAAATACGGATTGCTTTCAAGCTCCAGGAAGACAAGTTTCGGTTCATGGCGTTATCAGACACAAGACAAAGGGTCATGCCCATGGGACGAGATCGAAACTCTGATCGCAGTCAGACTCTTGCCTTCAAAGAAGCTGTTCTATTGACCAATCCAACCGATCCAAGACTTAAAGGAGAGGTCGATGACAAGTACCAATATTCTTGTGAAAACAAAGATAACAAGCTTCATGGGTGGATAGCCGAACCCGACGATAATCCAGCCGTGGGTTTCTGGGTAATCACCCCGAGCAATGAGTTCCGTACTGGTGGCCCTCACCATCAGGACCTCACTTCTCATGCTGGTCCCACCGCTCTCTCCATGTTTGTTAGTACGCATTATGCCGGAAAGGACATAGAAACGTCTTATAACGAAGGAGAGCCTTGGAAAAAGGTCTTTGGTCCTGTTCTAATCTATCTTAATTCTGCTTCCAAAGATGCTCGCAAGACACTCTGGGAAGACGCTAAACGACAGTTGAATCAAGAAATTGAAAGCTGGCCGTACAATTTCACTGGATCAGAAGATTTTCCTAATGCTGATGGACGAGGAAAAGTTAGTGGTCAATTACTAGTGCGAGACCGATACATGGATAATGAATTGATGCAGGCGCAATCTGCCTTTGTGGGGTTGGCACCACCTGGTGAGGCAGGATCATGGCAAACAGAAGGAAAGGGCTATCAATTCTGGACTCAAACCGACGAGAATGGCCGTTTCAAAATAGAAAATGTTCGACCAGGGGAGTATAATTTGTATGCATGGGTCCCTGGTTTCATTGGAAACTACAAATCAGATCTCAACATTACTATCGAACCAGGAAAAGACATCAAGTTGGGTACTCTTATATATGATCCTCCAAGAAATGGTCCGACATTGTGGGAAATCGGGATTCCTGACAGGACAGCTGCCGAGTTCTACGTACCGGATCCATACCCAACACTTATGAACTCAATATGCATCGATGACGTAGACAGATATAGACAATACGGATTGTGGGAACGCTACTCAGATATTTATCGTCACGGTGATCTTGTCTACACTGTGGGTGTTAGTAATTATTCTCGGGATTGGTTCTATGCTCATGTTACCAGGGATGGAGGGAACAGTACAAAGCGGCCAACCACATGGCAGATTAAGTACAATCTCGAAGATGTGAGCGAGACAGGAAATTACACTCTCCAATTGGCCTTGGCATCAGCTTCTTATGCTGAAGTACAGGTTCGATTCAACTATTCAGATTCTGACCGACCTTATTTTACAACGAGGCTAATAGGCAGCGATAATGCCGTAGCAAGGCATGGAATTCATGGATTATACAGATTGTATAGTATTATTGTACCTGGTAATCAATTTCAGAAAGGGGAAAATAAAATATTTCTCAGTCAGACAAGAAGCACGGGCGCATTCGACTCAGTTATGTATGACTACATTCGATTAGAAGGACCAACAAGTTAA
Protein:  
MEKVVNRSHLRQLVFWLLTIIELLLLLSASSEQLSARKLLRDDTTDPLQVQLETSDDNRVVIDNGLVQVTIGNPNGYLIGLKYKGFDNVLESRNKDQNKGYWDIVWDDNAIDKIETEQFKVITQTDDLVELSFSKTWNYTTDYRKAVPLNIDKRYIVRRGVSGVYMYGIFERLAEFPAAQMYQIRIAFKLQEDKFRFMALSDTRQRVMPMGRDRNSDRSQTLAFKEAVLLTNPTDPRLKGEVDDKYQYSCENKDNKLHGWIAEPDDNPAVGFWVITPSNEFRTGGPHHQDLTSHAGPTALSMFVSTHYAGKDIETSYNEGEPWKKVFGPVLIYLNSASKDARKTLWEDAKRQLNQEIESWPYNFTGSEDFPNADGRGKVSGQLLVRDRYMDNELMQAQSAFVGLAPPGEAGSWQTEGKGYQFWTQTDENGRFKIENVRPGEYNLYAWVPGFIGNYKSDLNITIEPGKDIKLGTLIYDPPRNGPTLWEIGIPDRTAAEFYVPDPYPTLMNSICIDDVDRYRQYGLWERYSDIYRHGDLVYTVGVSNYSRDWFYAHVTRDGGNSTKRPTTWQIKYNLEDVSETGNYTLQLALASASYAEVQVRFNYSDSDRPYFTTRLIGSDNAVARHGIHGLYRLYSIIVPGNQFQKGENKIFLSQTRSTGAFDSVMYDYIRLEGPTS